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BACKGROUND OF THE INVENTION 

The present invention relates to communication systems, and more 
particularly to multiple-input multiple-output wireless systems. 

Wireless communication systems typically use band-limited channels with 
time-varying (unknown) distortion and may have multi-users (such as multiple 
cellphone users within a cell). This leads to intersymbol interference and multi- 
user interference, and requires interference-resistant detection for systems which 
are interference limited. Interference-limited systems include multi-antenna 
systems with multi-stream or space-time coding which have spatial interference, 
multi-tone systems, TDMA systems having frequency selective channels with 
long impulse responses leading to intersymbol interference, CDMA systems with 
multi-user interference arising from loss of orthogonality of spreading codes, high 
data rate CDMA which in addition to multi-user interference also has intersymbol 
interference. 

Interference-resistant detectors commonly invoke one of three types of 
equalization to combat the interference: maximum likelihood sequence 
estimation, (adaptive) linear filtering, and decision-feedback equalization. 
However, maximum likelihood sequence estimation has problems including 
impractically large computation complexity for systems with multiple transmit 
antennas and multiple receive antennas. Linear filtering equalization, such as 
linear zero-forcing and linear minimum square error equalization, has low 
computational complexity but has relatively poor performance due to excessive 
noise enhancement. And decision-feedback (iterative) detectors, such as 
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iterative zero-forcing and iterative minimum mean square error, have moderate 
computational complexity but only moderate performance. 

Thus there is a problem of trade-off of performance and computational 
complexity for multiple-input, multiple-output systems. 

SUMMARY OF THE INVENTION 

The present invention provides a method and detector for multiple-input, 
multiple-output (Ml MO) systems which iteratively detects subsets of symbols and 
thereby allows reduced-state maximum likelihood decisions. 

This has advantages including improved performance with moderate 
increase in computational complexity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1a-1b are flow and functional block diagrams. 
Figures 2a-2c illustrate receivers and transmitters. 
Figures 3a-3d are simulation results. 
Figures 4a-4b show Turbo coding. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
1 . Overview 

Preferred embodiment detectors and detection methods for multi-input, 
multi-output (MIMO) systems partition the set of transmitted symbols into subsets 
and jointly detect the symbols within a subset and iteratively detect the subsets. 
This provides interference cancellation of previously detected symbols. The joint 
detection within a subset may be a linear soft detection followed by maximum 
likelihood (ML) hard decision. Figure 1a is a flow diagram, and Figure 1b 
illustrates functional blocks of a preferred embodiment which partitions of a set of 

P symbol indices, {1 , 2, .... P}, into M subsets, h, h I M , of sizes Pi, P 2 , P M , 

respectively. Thus, P - Zi<m<MPm- The hard estimates for the P m symbols with 
indices in subset I m are computed by a P m -input, P OT -output ML detector. These 
hard estimates are then used to regenerate the signal components associated 
with the symbols whose indices lie in I m , and the regenerated components from 
this iteration plus prior iterations are subtracted from the received signal 
(interference cancellation) for use as input to the next iteration. This use of ML 
detection in reduced states (subsets of indices) provides a range of performance- 
complexity trade-offs previously unavailable. 

Preferred embodiment wireless communication systems incorporate 
preferred embodiment detection methods. Figure 1b illustrates functional blocks 
of a preferred embodiment detector. Figure 2a shows a corresponding receiver 
for a MIMO wireless communications system; and Figure 2b shows a P-antenna 
transmitter for such a system. In such systems components, base stations, and 
mobile users, could each include one or more application specific integrated 
circuits (ASICs), (programmable) digital signal processors (DSPs), and/or other 
programmable devices with stored programs for implementation of the preferred 
embodiment. The base stations and mobile users may also contain analog 
integrated circuits for amplification of inputs to or outputs from antennas and 
conversion between analog and digital; and these analog and processor circuits 
may be integrated on a single die. The stored programs may, for example, be in 
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external or onboard ROM, flash EEPROM, and/or FeRAM. The antennas may 
be parts of RAKE detectors with multiple fingers for each user's signals. The 
DSP core could be a TMS320C6xxx or TMS320C5xxx from Texas Instruments. 



2. Iterative detectors 

First consider iterative detectors generally. Figure 2a illustrates a receiver 
with an interference-resistant detector as could be used in a wireless 
communications system with P transmit antennas (P data streams) and Q receive 
antennas. Figure 2b illustrates a corresponding transmitter with P transmit 
antennas. The received signal in such a system can be written as: 

r = Hs+ w 

where r is the ^-vector of samples of the received baseband signal (complex 
numbers) corresponding to a transmission time n: 

r x {n) 



r = 



r 2 (n) 



r Q (n) 

s is the P-vector of transmitted symbols (complex numbers of a symbol 
constellation) for time n: 

s 2 (n) 

s = 

s p (n) 

H is the QxP channel matrix of attenuations and phase shifts; and w is a g-vector 
of samples of received (white) noise. That is, the (q,p)th element of H is the 
channel (including multipath combining and equalization) from the pth transmit 
source to the qth receive sink, and the qth element of w is the noise seen at the 
qth receive sink. 

Note that the foregoing relation applies generally to various systems with 
various interference problems and in which n, r, s, P, and Q have corresponding 
interpretations. For example: 
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(a) High data rate multi-antenna systems such as BLAST (Bell Labs layered 
space time) or MIMO and multi-stream space-time coding: spatial interference 
suppression techniques are used in detection. 

(b) Broadband wireless systems employing OFDM (orthogonal frequency 
division multiplex) signaling and MIMO techniques for each tone or across tones. 

(c) TDMA (time division multiple access) systems having frequency-selective 
channels with long impulse response which causes severe ISI (intersymbol 
interference). Use equalizers to mitigate ISI. 

(d) CDMA (code division multiple access) systems having frequency-selective 
channels which cause MUI (multi-user interference) as a result of the loss of 
orthogonality between spreading codes. For high data rate CDMA systems such 
as HSDPA and 1xEV-DV, this problem is more severe due to the presence of ISI. 
Equalizers and/or interference cancellation may be used to mitigate these 
impairments. 

(e) Combinations of foregoing. 

Hence, P is essentially the number of symbols that are jointly detected as 
they interfere with one another, and Q is simply the number of collected samples 
at the receiver. Because there are P independent sources, Q must be at least as 
large as P to separate the P symbols. A (interference-resistant) detector in a 
receiver as in Figures 2a-2b analyzes received signal vector r and outputs soft 
estimates x of the transmitted symbols s to a demodulator and decoder. 

Presume that the different symbols which are transmitted via P different 
antennas are uncorrelated and also may utilize different modulation schemes. 
This implies the PxP matrix of expected symbol correlations, A = E[ss H ], is 
diagonal with entries equal the expected symbol energies (k k = E[|s k | 2 ]): 



0 



0 



0 



0 



A = 



0 



0 
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For linear filtering equalization detectors, such as linear zero-forcing (LZF) 
or linear minimum mean square error (LMMSE), the soft estimates, denoted by 
P-vector x, derive from the received signal by linear filtering with PxQ matrix F; 
namely, x = F r. 

In particular, LMMSE detection finds the matrix F by minimizing the mean 
square error, E[||x-s|| 2 ]. With perfect estimation of the channel H, the minimizing 
matrix F is given by : 

F lmmse = [H H H + a 2 A' 1 ]" 1 H H 
= AH H [ HAH H + a 2 l Q T 1 
where a 2 is the variance per symbol of the additive white noise w and Iq is the 
QxQ identity matrix. Note F has the form of a product of an equalization matrix 
with H H which is the matrix of the matched filter for the channel. 
Analogously, LZF detection finds the matrix F by inversion: 

Flzf= [H h H]- 1 H h 
Essentially, LZF detection nulls out the interference at the expense of 
excessive noise enhancement, while LMMSE detection minimizes the sum of 
interference and noise energy to provide a balanced trade-off between bias (due 
to residual interference) and variance (due to noise enhancement). But then 
each component of x is processed separately to obtain a corresponding hard 
symbol estimate. That is, a hard estimate, s„ for the ith symbol, s„ comes from 

application of a hard decision operator to the soft estimate: s, = V{ x, } for i = 1 , 2, 

P, where TJ>{ } is a non-linear decision device, usually a single-input, single- 
output (SISO) maximum-likelihood (ML) detector. 

A typical iterative (decision-feedback) detector (IZF or IMMSE) for blocks 
of P symbols has a series of P linear detectors (P iterations) with each linear 
detector followed by a (hard) decision device and interference subtraction 
(cancellation). Each of the P linear detectors (iterations) generates both a hard 
and a soft estimate for one of the P symbols. The hard estimate is used to 
regenerate the interference arising from the already-estimated symbols which is 
then subtracted from the received signal, and the difference used for the next 
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linear symbol estimation. Hence, the hard symbol estimates, s„ are also 
computed in a disjoint fashion. 

On the other hand, the optimal joint maximum likelihood (ML) detector 
generates the hard symbol estimates as: 



where C = Ci x Ci x ... x C P is the P-dimensional constellation of P-vectors of 
symbols and has size riis/s/> \Q\ where \Q\ denotes the number of elements in 
constellation q. Thus the hard symbol estimates are jointly computed, and the 
computational complexity of this optimal detection is proportional to the 
constellation size which is exponential in P. 

When an error correcting code is employed, soft symbol or soft bit (log 
likelihood ratio) estimates are often desired (see Section 8 for example). While 
soft symbol estimates can be obtained in many ways, the conditional expectation 
estimator is optimal. The conditional expectation estimator is: 



The performance loss of iterative detectors comes from the disjoint processing of 
the soft symbol estimates, x„ to obtain the corresponding hard symbol estimates, 
§,. The preferred embodiment detectors circumvent this problem by partial ML 
detections with moderate computational complexity. 

3. Iterative reduced-state maximum-likelihood detection 

Figure 1a is a flow diagram and Figure 1b shows functional blocks for first 
preferred embodiment detection methods which detect a P-vector of transmitted 
symbols, s, from a received g-vector signal, r, as expressed in the following 
pseudocode which presumes a partition of the set of symbol indices, {1,2,...,P}, 
into M subsets of indices h, I 2 , .... Im, where I m has P m elements. 

initialize r (1) = r 



s M L = arg min seC ||r-Hs|| 2 
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for m = 1 to M 

x (m) = F m r (m) // P m -vector of soft estimates for I m indexed symbols 

s (m) = MLDm { x (m) } // P m -vector of hard decisions for I m indexed symbols 

r (m+i) _ r (m) _ g(m) u j n t er f erence cancellation using hard estimates 

end 

where F m is a P m x Q matrix for the feedforward transformation of received signal 
into soft estimates, x (m) , of the l m indexed symbols; MLD m is a (P m , P m ) maximum 
likelihood (ML) detector; G m is a Q x P m matrix for the feedback transformation 
using the m\X\ iteration output for interference cancellation. The following 
paragraphs describe choices for F m , G m , and MLD m . And note that the preferred 
embodiment methods for trivial index set partitions reduce to the usual methods: 
for M= 1 (no partitioning), the method is just an ML detection for all of the 
symbols, and for M = P (every index is a separate subset), the method is just 
regular iterative detection. 

Zero-forcing and minimum mean square error criteria each define F m and 
G m as follows. Begin with some convenient notation: let J\ = {1,2, P) and 
recursively take J m = J mA - l mA for 2 < m < M, so J m is the set of indices of 
symbols remaining to be detected at the start of the wth iteration. Let H m be the 
Q x P m matrix consisting of the columns of H with index in I m , so H„, is the channel 
matrix for the P m symbols being detected during the roth iteration. Let K m be the 
Q x (Lm<k<MPk) matrix consisting of the columns of H with index in J m , so K m is the 
channel matrix for the symbols remaining to be detected at the start of the mth 
iteration (Ki = H). Let A m be the P m x P m diagonal sub-matrix of A of elements 
with both row and column indices in I m \ consequently, the elements of A m are the 
energies of the symbols to be detected during the mth iteration. Let Jl m be the 
(Z m <k<M A)x(Zmst<w Pk) diagonal sub-matrix of A of elements with both row and 
column indices in J m , so the elements of I\ m are the energies of the symbols 
remaining to be detected at the start of the with iteration. Let 1 OT be the P m xQ the 
matrix of the rows of the identity matrix Iq with row index in l m \ that is, 1«( P ,q) = 1 if 
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q is the pth element of I m and = 0 otherwise, and thus 1 m extracts the subspace 
spanned by the symbols to be detected in the mth iteration. 

Neglecting decision feedback error and choosing F m and G m according to 
the zero-forcing criterion yields for m = 1 , 2, AM : 

Fm = [Km H K m ] 1 K m H 
Gm ~ H m 

Analogously, choosing F m and G m according to the minimum mean square 
error criterion yields for m = 1 , 2, . . . , M-1 : 

F m = 1w [K m H K m + CJ 2 Jl m 1 ] 1 K m H 

= A m H m H [ K m Jlw K m H + a 2 Iq ]" 1 

G m 11 H m 

In both cases during the last iteration (m = M) all remaining symbols will be 
detected, so J M = 1m, K m = H M , H M = A M - Further, there is no next iteration and 
thus no need for a cancellation matrix Gm; see Figure 1b. The following sections 
4 and 5 provide an explicit example and an extension to compensate for decision 
feedback error. 

The maximum likelihood hard decision for the mth iteration, MLD m , 
depends upon the signal model for r (m) . In particular, 

r (m) = r (m-D_ Gm1§ (m-1) 

- r- Zi<k<m-i Gk s (k) 

= Hs + W - Zu k <m-1 G k s (k) 

where s {k) is the /Vvector of hard estimates of s (k) which is the /Vvector of 
symbols with indices in subset I k . 

Now for the case of no feedback error, G k = H k and s (k) = s (k) , so the model 
becomes 

r (m) = Zm^M Hj s fl) + w 
Thus the soft estimates can be written as: 

x (m) =Fmr (m) 

= F m H m S (m) + Z m+ 1^M F m Hj S a) + F m W 
■ m n m » T im 
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where i m is the residual interference-plus-noise term corresponding to the mth 
iteration detection. Note that the desired signal term, F m H m s (m) , is uncorrelated 
to the residual interference-plus-noise and that 

E[ im im H ] = F m K m +i Jl m (F m K m +i) H + a 2 F m F m H 
For notational convenience define ^ m = E[ i m i m H ] which is a P m x P m matrix. 
Then assuming the residual interference is approximately Gaussian with 
covariance matrix ^m, so log-likelihood metric for MLD m is just the Euclidean 
metric scaled by the covariance matrix: 

s (m) = arg min c(m) || Y m ~ 1 ' 2 (x (m) - F m H m c (m) ) || 2 
where c (m) is a /Vvector of symbols in the product of the P m constellations of 
symbols with indices in I m . Thus the hard decision is found by computing all of 
the metrics and taking the symbol vector which generates the minimum metric; 
and if all of the constellations have the same size |C| (such as all are |C|-QAM), 
then the number of metrics to compute is \C\ Pm . 

Note that for F m and G m defined using the zero-forcing criterion there is no 
residual interference because F m K m+ i = 0. In this case i m is exactly Gaussian. 
For the MMSE criterion, however, some residual interference exists; hence, i m is 
a mixture of Gaussian (from the noise) and non-Gaussian (residual interference) 
random processes. For sufficiently dense symbol constellations and/or a 
sufficiently large number of antennas, the interference can be well approximated 
as Gaussian. Otherwise, a variation of the MLD decision for MMSE-based 
detection is the use of the exact probability density function (pdf) of i m in the 
maximum likelihood detection. Because i m is the sum of F m w and Z m +i<j<M F m Hj 
s 01 , its pdf is the convolution of their pdfs. 

4. 4x4 example 

As an example consider the 4x4 MIMO system with the preferred 
embodiment partitioning the symbols into two subsets of two symbols each; that 
is, P = 0 = 4 with Pi = P 2 = 2 (M = 2). Explicitly, take h = {1 ,2} and h - {3,4}; 
then with h k denoting the Jfcth column of 4x4 channel matrix H, the zero-forcing 
criterion yields: 



TI-34889 Page 10 



Fi = 1 { ,, 2 , [H H H]" 1 H H 

Gi=[h! h 2 ] 
F 2 = [h 3 h 4 ] H 
Analogously, the MMSE criterion gives: 

Fi = 1 { i,2} [H H H + a 2 A-y H H 



(2x4 matrix) 
(4x2 matrix) 
(2x4 matrix) 



(2x4 matrix) 
[Aihih^ + A 2 h 2 h 2 H + a 2 I 4 ]~ 1 



Gi = [hi h 2 ] (4x2 matrix) 

F 2 = [h 3 h 4 ] H (2x4 matrix) 

The simulations of section 9 compare performance of various modifications of 
this 4x4 MIMO system preferred embodiment plus regular iterative detection. 



5. Feedback error compensation 

In general, the error-free decision feedback assumption is sufficiently 
accurate for high signal-to-noise ratio (SNR) and good channel conditions. 
Otherwise, error propagation may occur. When the MMSE criterion is used, the 
effect of error propagation can be taken into account by using the second order 
model which presumes the correlation of the transmitted symbols and their hard 
estimates is expressed by a diagonal PxP energy-correlation matrix: 



E[ss H ] = 



0 



o ^p 2 



0 
0 



0 0 ••• X pPp 

where the symbol-hard estimate correlations pj are generally real-valued and in 
the range -1 < Pj < +1 . Let R denote the PxP diagonal matrix of these 
correlations. Then define the P m x P m diagonal correlation submatrix R m with 
diagonal elements a for k in I m and the (Zm<k<M Pk)x(L mSk zM P k ) diagonal 
correlation submatrix fl m with diagonal elements p k for k in J m . Thus, R m is useful 
for interference cancellation at the end of the roth iteration when the s (m) are 
available, so the p k for k in I m are generally non-zero in R m . Further, fl m is useful 
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for interference suppression at the start of the mth iteration when the s (m, are not 
yet available and so the p k for k in I m are set to zero in forming fl m . All the p k for k 
in J m are not yet available so this is all Os. For multistage detection, some p k will 
be 0 for the undetected symbols in the first stage; during the second stage 
estimates from the previous stage for all the symbols are available and p k is non- 
zero for every k. 

Taking into account the effect of error propagation in the MMSE criterion 
approach leads to modified F m and G m ; for m =1,2,... M-1 : 

F m = 1m [K m H K m (I - 9\ m 2 ) + cr 2 Jl,„ 1 ] 1 K m H 
= A m H m H [ K m (I - fl m 2 )Jl m K m H + a 2 l Q ]" 1 

Gm = H m Rm 

Note that these modified matrices reduce to the prior expressions when p k = 1 for 
previously-detected symbols. In particular, p k = 1 for k in J m - I m reduces F m to 
the prior expression and p k = 1 for k in I m reduces G m to the prior expression. 

This feedback error compensation by use of correlations to limit 
cancellation allows for a multistage detection with each stage a set of M 
iterations as in the foregoing and with the hard estimates and correlations of one 
stage used as initial conditions for the next stage. In this manner, performance 
can be improved in stages by controlling the cancellation according to current 
correlation. Of course, the correlations must be computed, either by ensemble 
averaging (i.e., use a pilot channel or a set of training symbols that undergo the 
same MIMO channel as the information symbols) or by use of an analytical 
model (e.g., specific symbol constellation and Gaussian approximations) which 
can relate correlations and SINRs (which are used to define the F and G 
matrices) plus provide convergence to limiting values through successive stages. 

In particular, if superscript/subscript [n] denotes nth stage variables, then 
the first stage would be as described in the preceding paragraphs: the mth 
iteration would include: 

r [1](m) = r [1K^)_G [1]m -iS [1l(m - 1) 

= r-Zi^m-iG mk s [1](k) 
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r[l]m 


= A m H m H f Km (1 - flMlm 2 )Jlm Km" + CF 2 <Q ] 


x [1](m) 


= F [1]m r™ m > .. 


§[1](m) 


= MLD m {x I1](m) } 


G[i] m 


= H m R[i]m 



Then the second stage would have first stage hard estimates of interfering 
subsets of symbols to include in the cancellations, so 

r [2](m) = jRKm-1) _ §[2l(m-D 

= r - Zi, ksm -1 G [2]k §®< k) - Ln*i*M G m s [1](j) 
Note that the hard estimates from the first stage for the subset of symbols being 
detected, s [1](m) , are not part of the cancellation. And the correlation matrices 
flpjm can use first stage correlations p^ ]k for k in J m , whereas Rpjm is used after 
the /nth iteration and will have correlations f\ 2 ]k for k in J m . And so forth for 
subsequent stages. 

6. Ordered detection 

Ordered detection based on the symbol post-detection signal-to- 
interference-plus-noise ratio (SINR) is often used to further improve performance, 
error. Selecting the detection ordering in the foregoing preferred embodiments 

amounts to selecting the index sets h,h Im that form a partition of {1 ,2 P}. 

One may select a pre-determined detection ordering that does not depend on the 
channel realization H. Analogous to the conventional iterative detection, further 
improvement can be obtained by selecting the ordering as a function of H based 
on a selected performance measure as in the following examples. 

(1) Pre-ML SINR measures 

At the start of the mth iteration, compare the (I„<*<a^*) SINR values 
associated with the index set J m , and select index set I m to label the P m largest 
SINR values. Note that closed form expressions for the SINR as a function of 
the channel realization can be obtained for different detection criteria. In 
particular, define the SINR associated with symbol s p at the mth iteration as y p (m) . 
For the zero-forcing criterion this yields for p e I m : 
Y P < m) = Xp/a 2 [(K m H K m r 1 ]p,p 
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Whereas the MMSE criterion together with the feedback error compensation 
yields for p e I m \ 

Yp (m) = A, p h p H [Tnej m+i A, n (1-Ai 2 )h n h n H + Zke/^-p ^khkh k H + a 2 Iq ]" 1 h p 
= / [a 2 JV 1 (K m H K m ) (I - fl m 2 ) + a 2 Jl w " 1 )- 1 ] PiP - 1 
where h k is the kth column of H, and the case of error-free feedback is a special 
case. 

(2) Post-ML SINR measures 

Intuitively, better detection ordering can be obtained when a measure of 
SINR after the ML detection. Several possible post-ML detection measures can 
be derived from the symbol error rate (SER) expression of ML detection. Starting 
from the ML metric above, scale by W m ~ V2 to define: 

y(m) ^ xj/^ -1/2 x (m) 
©m = 1/2 Fm K m 

S def XU -1/2 : 

Jm = i m «m 

Then 

y(m) = -1/2 x (m) 

= ^m' 1/2 (F m K m s (m) + i m ) 

= T m " 1/2 0 m S (m) + j m 

Here the scaled residual interference-plus-noise term, j m , is white with E[ j m j m H ] 
= I. Let .£ w denote the /^-dimensional product of constellations of the symbols 
with indices in I m \ that is, $ m = Y\ k€ i m C k where C k is the symbol constellation for s* 

and the product is over k in I m . Also let N m denote the size of £ w , so N m = rW. 
\C k \. An upper bound of symbol error rate (SER) corresponding to MLD m can be 
derived as follows: 

SER m = Pr(s< m ) * s (m) ) 

= (VN m ) I«e 5m Pr(s< m > * s (m) | s (m) = ex) 

= (1/ N m ) I« e5m Z &1 &*« Pr(s<™> = ex | s (m) = <x) 
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= (1/ N m ) I« e§m Z«, **« Pr<rv* {||y (m) -0 m (x|| 2 <||y (m) -0mPII 2 }) 

< (MN m ) Ic, e§m Z«, &*a Pr(||y (m) -0 m 6(|| 2 <||y (m) -© m a|| 2 ) 
=(1/ A^m) S« 6 § m Z&. «*« Q( V(||0m(«-(x)|| 2 12)) 

where the Q function is the area under the Gaussian tail. This upper bound on 
SER m can be used to select the detection order: take I m to minimize the upper 
bound. Thus this measure is post-ML based on the SER upper bound. 

A looser upper bound can be derived from the fact that for a,o(e^: 

||0m(«-cx)|| 2 > mini<,</> m (||[0 m ],,|| 2 6 2 minJ ) 
where d 2 m; „,, is the minimum distance among symbols in the symbol constellation 
associated with the z'th column of © m , [0m].,/. Thus, 

SER m ^ (1/ N m ) Hoi£$m Za, a*a Q(V(||0m(O(-«)|| 2 /2)) 

< Q(V( minW||[0 m ],,|| 2 d 2 m ,„,, ) )) 

When the same constellation (modulation scheme) is used for all of the transmit 
antennas, the minimum distance is common and the upper bound simplifies to: 

SER m < (Wm-1 ) Q(V( d 2 m ,„ mini s <p m (ll[®mLH 2 ) )) 
This upper bound yields the following post-ML measure for selecting the 
detection order: at the mVn iteration, select / TO to maximize mini<,<p m (||[0 m ].,,|| 2 ). 
This measure is thus post-ML based on the channel norm after MMSE 
transformation. 

7. Decision alternatives 

The foregoing preferred embodiments hard decision methods, MLD m , 
convert x k (m) into s k (m) use a maximum likelihood approach which applies a hard- 
limiter (hard-clipping) non-linearity. Alternative non-linearities, generically 

denoted as s (m) = D{x (m) }, such as hyperbolic tangent, may result in better 
performance. In this case the decision statistics that are close to zero are 
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effectively scaled instead of clipped; the scaling depends upon the variance of 
the residual interference plus noise. For example, for a single BPSK symbol: 

Hyperbolic tangent: Dtanh(z) = tanh(Re{z}/^) 

Soft linear clipped: Dsl(z) = +1 ifRe{z}>^ 

= -1 if Re{z} < 

= Re{z} if | Re{z} | < 4 
where 4 is a non-negative constant. For higher order modulation, the decision is 
done in the bit level following the bit log likelihood ratio computation (as 
described in Section 8). Alternatively, for QPSK, the real and imaginary parts of : 

D(z) can be taken as tanh(Re{z}/£) and tanh(lm{z}/£), respectively; or the 
corresponding soft linear with Re{z} and lm{z}, respectively. Further, for higher 
order (larger constellations) the soft linear extends by selecting the constant to 

make D(z) = z for z in the constellation. And for two or higher-dimensional 
constellations, extend by components. For example, with a subset of symbols of 
size P m = 2, the ML decision on the soft estimate 2-vector x (m) gives the 2-vector 
of symbols in the product constellation closest to x (m) ; whereas, the soft linear 

would give the 2-vector [ T>s L (xi (m) ), D S L(x 2 (m) )]. 

While any of those non-linear estimators can be used, the optimal soft 
estimator is the conditional expectation estimator (analogous to Section 2), which 
is 

X«»pf-K"" 2 ( x< "'- F « H " c l 2 ) 

S<«> = £ i<"» | X (») =_« v _L 

2exp(-|^-" 2 (x'»'-F„H„c| 2 ) 

where the summation is over the set formed by the product of P m constellations 
of symbols with indices in l m (see discussion at the end of Section 3). 

For systems with large constellation and/or a number of antennas, the 
computational burden may be prohibitive as it increases exponentially with the 
number of transmit antennas. In this case, sphere decoding type algorithm can 
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be used to reduce computational complexity at the expense of some 
performance (while retaining most of the gain of maximum likelihood detector). 

8. Decoding 

The demodulator converts the soft symbol estimates x (;) , x {2) , .... x {M) 
output by an M-iteration detector into conditional probabilities; and the conditional 
probabilities translate into bit-level log likelihood ratios (LLRs) for (sequence) 
decoding. In more detail, the LLRs in terms of the bits u pk which define the 
constellation symbols s p (e.g., two bits for a QPSK symbol, four bits for a 16QAM 
symbol, etc.) are defined (dropping the superscripts on the x p ) as 

LLR(Up k ) = log{ P[u pk = 1 | x pk ] / P[u pk = 0 | x pk ] } 

= log{ P[u pk = 1 | x pk ]} - log{ P[u pk = 0 | x pk ] } 
The LLRs can be computed using a channel model. For example, 

LLR(u pk ) = log{ P[u pk = 1 | x p ] / P[u pk = 0 | x p ] } 

= log{p(x p | Upk = 1 ) / p(x p | u pk = 0)} + log{P[u pk = 1]/P[u nk =0]} 
where the first log term includes the probability distribution of the demodulated 
symbol x p which can be computed using the channel model. The second log 
term is the log of the ratio of a priori probabilities of the bit values and typically 
equals 0. So for an AWGN channel where the residual interference (interference 
which is not cancelled) is also a zero-mean, normally-distributed, independent 
random variable, the channel model gives: 

p(x p | s p = c ) ~ exp( - |x p - c| 2 / y p ) 
where c is a symbol in the symbol constellation and y p is a normalization typically 
derived from the channel characteristics and the detector type. Indeed, the 
following section describes known normalizations and preferred embodiment 
normalizations. Of course, y p is just twice the variance of the estimation error 
random variable. 

Then compute LLRs by using an approximation which allows direct 
application of the channel model. Take p(x p | u pk = 1) = p(x p | s p = c pk =i ) where 
c pk =i is the symbol in the sub-constellation of symbols with kth bit equal 1 and 
which is the closest to x p ; that is, c pk =i minimizes |x p - Ck=i | 2 for Ck=i a symbol in 



TI-34889 Page 17 



the sub-constellation with kth bit equal to 1 . Analogously for p(x p | u pk = 0) using 
the sub-constellation of symbols with kth bit equal 0. Then with equal a priori 
probabilities of the bit values and the notation subscript k=1 and k=0 indicating 
symbols with kth bit 1 and 0, respectively, the approximation yields 
LLR(Up k ) = log{ p(x p | u pk = 1) / p(x p | u pk = 0) } 

s 1/y p . {min k=0 |x p - c k=0 | 2 - min k=1 |x p - c k=1 | 2 } 
Thus the LLR computation just searches over the two symbol sub-constellations 
for the minima. 

The LLRs are used in decoders for error correcting codes such as Turbo 
codes (e.g., iterative interleaved MAP decoders with BCJR or SOVA algorithm 
using LLRs for each MAP) and convolutional codes (e.g. Viterbi decoders). Such 
decoders require soft bit statistics (in terms of LLR) from the detector to achieve 
their maximum performance (hard bit statistics with Hamming instead of 
Euclidean metrics can also be used but result in approximately 3dB loss). 
Alternatively, direct symbol decoding with LLRs as the conditional probability 
minus the a priori probability could be used. Figures 4a-4b illustrate the 3GPP 
Turbo encoder and an iterative decoder which includes two MAP blocks, an 
interleaver, a de-interleaver, and feedback for iterations. 

9. Simulations 

Simulations for 4x4 MIMO QPSK and 16QAM systems with P = Q = 4 in 
spatially uncorrelated (IID) and correlated MIMO channels were performed to 
compare a preferred embodiment with other detection methods. The correlated 
spatial channel profile has the average and peak cross correlation of 0.16 and 
0.40, respectively. Figures 3a-3d compare simulations with the following six 
detection methods including four preferred embodiment iterative reduced state 
ML variations: 

(1) IMMSE UNORD: regular iterative MMSE without detection ordering. 

(2) IMMSE ORD: regular iterative MMSE with detection ordering based on 
post-detection SINR. 

(3) IRSML UNORD: MMSE-based iterative reduced state ML for M= 2, Pi 
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= P 2 = 2 and without ordering. 

(4) IRSML ORD Pre-ML SINR: MMSE-based iterative reduced state ML 
for M = 2, Pi = P 2 = 2 and with ordering based on SINR after MMSE 
transformation. 

(5) IRSML ORD Post-ML Norm: MMSE-based iterative reduced state ML 
for M=2,Pi=P 2 = 2 and with ordering based on channel norm after MMSE 
transformation. 

(6) IRSML ORD Post-ML SER: MMSE-based iterative reduced state ML 
for M = 2,P< i =P 2 = 2 and with ordering based on SER bound after ML detection. 

The raw bit error rate (BER) versus signal-to-noise ratio (Eb/N 0 ) results for 
QPSK in IID and correlated channels appear in Figures 3a-3b, respectively, and 
for 16QAM in IID and correlated channels appear in Figures 3c-3d, respectively. 
Observe that the gain of IRSMLover regular IMMSE is more dramatic in spatially 
correlated channels and when no detection ordering is used. Notice also that 
IRSML with SER-based detection ordering results in a gain of 10-dB for 16QAM 
in correlated channels at BER = 1(T 3 compared to the regular IMMSE with 
detection ordering. 

The computational complexity of preferred embodiment iterative reduced 
state ML methods can be compared to the full ML and the regular MMSE 
methods. In particular, consider two aspects: matrix inversion computation and 
hard decision estimation. For comparison, assume that the preferred 
embodiments use equal size subsets (P m = P/M) and all methods use the same 
constellation (size C) for all symbols. Then the following table indicates the 
complexity of the methods. 



Detector 


Matrix inversion 


Hard decision 


Regular iterative 


-PxF 3 


~PxC 


Maximum likelihood 


none 


~(? 


IRSML 


-MxP 5 





Thus for the preferred embodiments with two subsets (M = PI2 and P m =2), 
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the computational complexity of the preferred embodiments is comparable to the 
regular iterative detector for small to moderate constellation size. For large 
constellation size (such as 64-QAM or 256-QAM), the complexity increase for the 
preferred embodiments over the regular iterative detector becomes significant. 
However, its performance gain is expected to become more dramatic as 
indicated by the simulations. 
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